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Abstract. In this work we completely characterize how the frequency with which each player par- 
ticipates in the game dynamics affects the possibility of reaching efficient states, i.e., states with an 
approximation ratio within a constant factor from the price of anarchy, within a polynomially bounded 
number of best responses. We focus on the well known class of congestion games and we show that, 
if each player is allowed to play at least once and at most /3 times any T best responses, states with 
approximation ratio 0(/3) times the price of anarchy are reached after T [log log n] best responses, and 
that such a bound is essentially tight also after exponentially many ones. One important consequence 
of our result is that the fairness among players is a necessary and sufficient condition for guaranteeing 
a fast convergence to efficient states. This answers the important question of the maximum order of /3 
needed to fast obtain efficient states, left open by [9I10| and [3], in which fast convergence for constant 
/3 and very slow convergence for /3 = 0{n) have been shown, respectively. Finally, we show that the 
structure of the game implicitly affects its performances. In particular, we show that in the symmetric 
setting, in which all players share the same set of strategies, the game always converges to an efficient 
state after a polynomial number of best responses, regardless of the frequency each player moves with. 

Keywords: Congestion Games, Speed of Convergence, Best Response Dynamics. 

1 Introduction 

The class of congestion games is a well established approach for modelling any non-cooperative system in 
which a set of resources are shared among a set of selfish players. In a congestion game we have a set of 
m resources and a set of n players. Each player's strategy consists of a subset of resources. The delay of a 
particular resource e depends on its congestion, corresponding to the number of players choosing e, and the 
cost of each player i is the sum of the delays associated with the resources selected by z. In this work we focus 
on linear congestion games where the delays are linear functions. A congestion game is called symmetric if 
all players share the same strategy set. A state of the game is any combination of strategies for the players 
and its social cost, defined as the sum of the players' costs, denotes its quality from a global perspective. 
The social optimum denotes the minimum possible social cost among all the states of the game. 

Related work. Rosenthal has shown, by a potential function argument, that the natural decentral- 
ized mechanism known as Nash dynamics consisting in a sequence of moves in which at each one some player 
switches its strategy to a better alternative, is guaranteed to converge to a pure Nash equilibrium |12| . 

In order to measure the degradation of social welfare due to the selfish behavior of the players, Koutsoupias 
and Papadimitriou [TT^ defined the price of anarchy as the worst-case ratio between the social cost in a Nash 
equilibrium and that of a social optimum. The price of anarchy for congestion games has been investigated 
by Awerbuch et al. 2_ and Christodoulou and Koutsoupias [5^ . They both proved that the price of anarchy 
for congestion games with linear delays is 5/2. 

* This research was partially supported by the grant NRF-RF2009-08 "Algorithmic aspects of coalitional games" 
and by the PRIN 2008 research project COGENT (Computational and GamE-theoretic aspects of uncoordinated 
NeTworks), funded by the Italian Ministry of University and Research. 



The existence of a potential function relates the class of congestion games to the class of polynomial local 
search problems (PLS) [7|. Fabrikant et al. [8] proved that, even for symmetric congestion games, the problem 
of computing Nash equilibria is PLS-complete [7] . One major consequence of the completeness result is the 
existence of congestion games with initial states such that any improvement sequence starting from these 
states needs an exponential number of steps in the number of players n in order to reach a Nash equilibrium. 
A recent result by Ackermann et al. [T] shows that the previous negative result holds even in the restricted 
case of congestion games with linear delay functions. 

The negative results on computing equilibria in congestion games have lead to the development of the 
concept of e-Nash equilibrium, in which no player can decrease its cost by a factor of more than e. Un- 
fortunately, as showed by Skopalik and Vocking [14 , also the problem of finding an e-Nash equilibrium in 
congestion games is PLS-complete for any e, though, under some restrictions on the delay functions, Chien 
and Sinclair [3] proved that in symmetric congestion games the convergence to e-Nash equilibrium is poly- 
nomial in the description of the game and the minimal number of steps within each player has chance to 
move. 

Since negative results tend to dominate the issues relative to equilibria, another natural arising question 
is whether efficient states (with a social cost comparable to the one of any Nash equilibrium) can be reached 
by best response moves in a reasonable amount of time (see |3I6I9I10| V We measure the efficiency of a state 
by the ratio among its cost and the optimal one, and we refers to it as the approximation ratio of the state. 
We generally say that a state is efficient when its approximation ratio is within a constant factor from the 
price of anarchy. Since the price of anarchy of linear congestion games is known to be constant [2 5, efficient 
states approximate the social optimum by a constant factor. It is worth noticing that in the worse case, a 
generic Nash dynamics starting from an arbitrary state could never reach a state with an approximation 
ratio lower than the price of anarchy. Furthermore, by a potential function argument it is easy to show that 
in a linear congestion game, once a state S with a social cost C{S) is reached, even if such a state is not a 
Nash equilibrium, we are guaranteed that for any subsequent state S' of the dynamics, C{S') = 0{C{S)). 
Awerbuch et al. [3] have proved that for linear congestion games, sequences of moves reducing the cost 
of each player by at least a factor of e, converge to efficient states in a number of moves polynomial in 
1/e and the number of players, under the minimal liveness condition that every player moves at least once 
every a polynomial number of moves. Under the same liveness condition, they also proved that exact best 
response dynamics can guarantee the convergence to efficient states only after an exponential number of 
best responses [3]. Nevertheless, Fanelli et al. have shown that, under more restrictive condition that each 
player plays exactly once every n best responses, any best response dynamics converges to an efficient state 
after 0(rt log log n) best responses. Subsequently, Fanelli and Moscardelli [10] extended the previous results 
to the more general case in which each player plays a constant number of times every 0{n) best responses. 

Our Contribution. In this work we completely characterize how the frequency with which each player 
participates in the game dynamics affects the possibility of reaching efficient states. In particular, we close 
the most important open problem raised by [3] and |9|10j for linear congestion games. On the one hand, in [3] 
it is shown that, even after an exponential number of best responses, states with a very high approximation 



ratio, namely f] (^i^J^jj '^^^ t)e reached. On the other hand, in [9|10j it is shown that, under the minimal 
liveness condition in which every player moves at least once every T steps, if players perform best responses 
such that each player is allowed to play at most /3 = 0(1) times any T steps (notice that /3 = 0{1) implies 
T = 0{n)), after Tfloglogn] best responses a state with a constant factor approximation ratio is reached. 

The more /3 increases, the less the dynamics is fair with respect to the chance every player has of 
performing a best response: /? measures the degree of unfairness of the dynamics. The important left open 
question was that of determining the maximum order of /3 needed to obtain fast convergence to efficient 
states: We answer this question by proving that, after T [log log n] best responses, the dynamics reaches 
states with an approximation ratio of 0{/3). Such a result is essentially tight since we are also able to show 
that, for any e > 0, there exist congestion games for which, even for an exponential number of best responses, 
states with an approximation ratio of i7(/3^~^) are obtained. Therefore, /3 constant as assumed in |9|10j is 
not only sufficient, but also necessary in order to reach efficient states after a polynomial number of best 
responses. 
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Finally, in the special case of symmetric congestion games, we show that the unfairness in best response 
dynamics does not affect the fast convergence to efficient states; namely, we prove that, for any /3, after 
T [log log n] best responses efficient states are always reached. 

The paper is organized as follows: In the next section provide the basic notation and definitions. Section 
[3] is devoted to the study of asymmetric congestion games, while Section |4] to the analysis of the symmetric 
case. Finally, Section [5] extends the results to more general settings and gives some conclusive remarks. 

2 Model and Definitions 

A congestion game Q = {N,E, {Si)iizjq, {ff,)e^E, {ci)i(^N) is a non-cooperative strategic game characterized 
by the existence of a set E of resources to be shared by n players in TV = {1, . . . , n}. 

Any strategy Si G Si of player z e is a subset of resources, i.e.. Si C 2^ . A congestion game is called 
symmetric if all players share the same set of strategies, i.e., S = Si for every i & N . Given a state or 
strategy profile S = (si, . . . , s„) and a resource e, the number of players using e in S, called the congestion 
on e, is denoted by ne(S') = |{i G A | e e Si}\. A delay function /e : N i-> M+ associates to resource e a delay 
depending on the number of players currently using e, so that the cost of player i for the pure strategy Si is 
given by the sum of the delays associated with the resources in s^, i.e., Ci{S) = X^eGs feii^eiS))- 

In this paper we will focus on linear congestion games, that is having linear delay functions with nonneg- 
ative coefficients. More precisely, for every resource e € E, fe{x) = agX + be for every resource e ^ E, with 
ae,he > 0. 

Given the strategy profile S — (si,...,s„), the social cost C{S) of a given state S is defined as the 
sum of all the players' costs, i.e., C{S) = J2ieN Ci(<5'). An optimal strategy profile S* = {si, ... , s* ) is one 
having minimum social cost; we denote C{S*) by Opt. The approximation ratio of state S is given by the 
ratio between the social cost of S and the social optimum, i.e., Moreover, given the strategy profile 

S = {si,S2, . ■ . , Sn) and a strategy s- S Si, let (5-^, s-) = (si, S2, . . . , Sj_i, s-, s^+i, . . . , s„), i.e., the strategy 
profile obtained from S if player i changes its strategy from Si to s'^. 

The potential function is defined as ^{S) = X^es-E feU)- It is call exact potential function since 

it satisfies the property that for each player i and each strategy s'^ G Si of i in S, it holds that Ci{S-i, ~ 
Ci{S) = <l>{S-i,s^) — <1>{S). It is worth noticing that in linear congestion games, for any state S, it holds 
^{S) < C{S) < 2^{S). 

Each player acts selfishly and aims at choosing the strategy decreasing its cost, given the strategy choices 
of other players. A best response of player i in 5 is a strategy s\ G Si yielding the minimum possible cost, 
given the strategy choices of the other players, i.e., Ci{S-i,s\) < Ci{S-i,s[) for any other strategy s- e Si. 
Moreover, if no s'^ e Si is such that Ci{S-i,s'j) < Ci{S), the best response of i in S* is Si. We call a best 
response dynamics any sequence of best responses. 

Given a best response dynamics starting from an arbitrary state, we are interested in the social cost of 
its final state. To this aim, we must consider dynamics in which each player moves at least once every a 
given number T of best responses, otherwise one or more players could be "locked out" for arbitrarily long 
and we could not expect to bound the social cost of the state reached at the end of the dynamics. Therefore, 
we define a T- covering as a dynamics of T consecutive best responses in which each player moves at least 
once. More precisely, a T-covering R — (S*^, . . . , S*^) is composed by T best responses; 5^ is said to be the 
initial state of R and S]^ is its final state. For every 1 < t < T, let 7rfl(i) be the player performing the t-th 
best response of R; tt^ is such that every player performs at least a best response in R. In particular, for 

every I <t<T, 3%^= ((S'^~^)-7rH(t), s^^(t)) and s^^^j^ is a best response of player 7r_R(i) in S'j^^. For any 

i = I, . . . , n, the last best response performed by player i in i? is the lastij(i)-th best response of R, leading 
from state 5'iastH(i)-i state 5'iastR(i)^ When clear from the context, we will drop the index R from the 
notation, writing 5*% tt and last(i) instead of S}^, ttr and last/j(i), respectively. 

Definition 1 (T-Minimum Liveness Condition). Given any T > n, a best response dynamics satisfies 
the T-Minimum Liveness Condition if it can be decomposed into a sequence of consecutive T-coverings. 
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In Section 13.21 we show that (for the general asymmetric case) under such a condition the quaUty of the 
reached state can be very bad even considering T = 0{n) (see Corollary [1]): It is worth noticing that in the 
considered congestion game, only players perform a lot of best responses {y/n best responses) in each 
covering, while the remaining n — \fn players perform only one best response every T-covering. The idea 
here is that there is a sort of unfairness in the dynamics, given by the fact that the players do not have the 
same chances of performing best responses. 

In order to quantify the impact of fairness on best response dynamics, we need an additional parameter 
/? and we define the /3-bounded T-covering as a T-covering in which every player performs at most /3 best 
responses. 

Definition 2 ((T, /3)-Fairness Condition). Given any /? > 1, a dynamics satisfies the (T, /S)- Fairness 
Condition if it can be decomposed into a sequence of consecutive j3-hounded T-coverings. 

Notice that /? is a sort of (un)fairness index: If /3 is constant, it means that every player plays at most a 
constant number of times in each T-covering and therefore the dynamics can be considered fair. 

In order to prove our upper bound results, we will focus our attention on particular congestion games to 
which any linear congestion game is best-response reducible. The following definition formally states such a 
notion of reduction. 

Definition 3 (Best-Response Reduction). A congestion game Q is Best-Response reducible to a conges- 
tion game Q' with the same set of players if there exists an injective function g mapping any strategy profile 
S of Q to a strategy profile g{S) of G' such that 

(i) the social cost of S in Q is equal to the one of g[S) in Q' and for any i — 1, . . . ,ri the cost of player i in 
S is equal to the one of player i in Q' 

(ii) there exists, in the game Q , a best response of a player, say i, in S leading to state S' if and only if there 
exists, in the game Q' , a best response of player i in g{S) leading to state g{S'). 

3 The Impact of Fairness 

In this section we first (in Subsection 13. II) provide an upper bound to the approximation ratio of the states 
reached after a dynamics satisfying the (T, /3)-Minimum Liveness Condition, starting from an arbitrary state 
and composed by a number of best responses polynomial in n. Finally (in Subsection 13. 2p . we provide an 
almost matching lower bound holding for dynamics satisfying the same conditions. 

3.1 Upper bound 

All the results hold for linear congestion games having delay functions fe{x) = a^x -\- be with Og, 6e > for 
every e € E. Since our bounds are given as a function of the number of players, the following proposition 
allows us to focus on congestion games with identical delay functions f{x) — x. 

Proposition 1. Any linear congestion game is best-response reducible to a congestion game having the same 
set of players and identical delay functions f{x) — x. 

Proof. Given a congestion game Q having delay functions fdx) — OeX + bf, with integer coefhcient Oe, 6e > 0, 
it is possible to reduce it to a congestion game Q' , having the same set of players and identical delay functions 
f{x) = X in the following way. For each resource e inQ, we include in Q' a set A,, of resources and n sets 
_Bg, . . . , i?", each containing be resources; moreover, given any strategy Si £ Si \n Q , i = 1, . . . , n, we build 
a corresponding strategy € (in Q') by including in s,^, for each e G Si, all the resources in the sets Ae 
and The function g is such that any strategy profile (si, . . . , s„) of G is mapped to the strategy profile 
{s'l, . . . , s^) of G' . If Oe and be are not integers we can perform a similar reduction by exploiting a simple 
scaling argument. □ 
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Since the dynamics satisfies the (T, /3)-Fairness Condition, we can decompose it into k /3-bounded T- 
coverings Ri, . . . , Rk- 

Consider a generic /3-bounded T-covering R ~ (5*°, . . . , S^y In the following we will often consider the 
immediate costs (or delays) of players during R, that is the cost C7r(t)(S'*) right after the best response of 
player Tr{t), for i = 1, . . . , T. 

Given an optimal strategy profile 5*, since the t-th player 7r(i) performing a best response, before doing 
it, can always select the strategy she would use in S**, her immediate cost can be suitably upper bounded as 

By extending and strengthening the technique of [9110) . we are able to prove that the best response 
dynamics satisfying the (T, /3)-Fairness Condition fast converges to states approximating the social optimum 
by a factor 0(/3). It is worth noticing that, by exploiting the technique of |9|10j . only a much worse bound of 
0{[3^) could be proved. In order to obtain an 0{f3) bound, we need to develop a different and more involved 
technique, in which also the functions p and H, introduced in |9I10| . have to be redefined: roughly speaking, 
they now must take into account only the last move in R of each player, whereas in [9110] they was counting 
for all the moves in R. 

We now introduce functions p and H, defined over the set of all the possible /3-bounded T-coverings: 

- Let p{R) ^ ELi Eee.* + 1); 

- let i/(i?)=E:iiEee<'^e(50). 

Notice that p{R) is an upper bound to the sum over all the players of the cost that she would experience 
on her optimal strategy s* just before her last move in R, whereas H{R) represents the sum over all the 
players of the delay on the moving player's optimal strategy s* in the initial state S*" of R. Moreover, since 
players perform best responses, Ci(<S'^'^^*"^*^) < piR), i-e. p{R) is an upper bound to the sum of the 

immediate costs over the last moves of every players. 

The upper bound proof is structured as follows. Lemma[T]relates the social cost of the final state S'^ of a /3- 
bounded T-covering R with p{R), by showing that C{S'^) < 2p{R). By exploiting Lemmata[2]and[3l providing 
an upper (resp. lower) bound to H{R) in terms of p{R) (resp. p{R)), Lemma 2] proves that rapidly 

decreases between two consecutive /3-bounded T-coverings R and i?, showing that = O • 

the proof of Theorem 1, after deriving a trivial upper bound equal to 0{n) for p{Ri), Lemma|4]is applied 
to all the k — 1 couples of consecutive /3-bounded T-coverings of the considered dynamics satisfying the 
(T, /3)-Fairness Condition. 

The following lemmata show that the social cost at the end of any /3-bounded T-covering R is at most 
2p{R), and that fast decreases between two consecutive /3-bounded T-coverings. They can be proved by 
adapting some proofs in |9I10| so that they still hold with the new definition of p. 

Lemma 1. For any f3 > 1, given a /S-bounded T-covering R, C{S^) < 2p{R). 

Proof. Since the players perform best responses, inequality ([IJ below holds. Moreover, since in R each player 
performs at least a best response, inequality © holds because for each resource e used in the state 5^ there 
must exist, for every integer j such that 1 < j < ne{S'^), a player having, just after her last best response 
in R, a delay on e at least i. Therefore, by recalling the definition of p{R), 

n 
n 

>^c,(5'"^*«) (1) 

i=l 

eeE j = l 
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Lemma 2. For any /? > 1, given a jS-bounded T -covering R ending in S"^ , ^ese ""^^^ ) ^ 2^^^. 

Proof. By applying the Cauchy-Schwarz inequality, we obtain 



eGB y eS-E eG-E 

Moreover, by Lenima[T]we have J2eeE* "'e(5'^) = C{S'^) < 2p{R); therefore, since Y^eeE' '"'li^*) = Opt, 
we obtain 

E ne{S'^)n,{S*) < ^2p(i?)0PT. (3) 

eGE 

Dividing by Opt both sides of the above inequahty, the claim follows. □ 
In Lemma[3]we are able to relate p{R) and H{R) by much strengthening the technique exploited in j9|lQj. 
Lemma 3. For any /? > 1, given a j3-hounded T-covering R, 

^<2« + 4/3 + l. 
Opt Opt 

Proof. First of all, notice that if the players performing in R best responses being improvement moves never 
select strategies used by some player in S* , i.e. if the players select only resources e such that ne{S*) — 0, 
than, by recalling the definitions of p{R) and H{R), p{R) < H{R) + Opt and the clam would easily follow 
for any /3 > 1. 

In the following our aim is that of dealing with the generic case in which players moving in R can increase 
the congestions on resources e such that n^iS*) > 0. 

For every resource e € -E, we focus on the congestion on such a resource above a "virtual" congestion 
frontier — 2[ine{S*). 

We assume that at the beginning of covering R each resource e E E has a congestion equal to 
(5o,e = max{?ie(S'°), (?e}, and we define (5o,e as the congestion of level on resource e; moreover, Aq = 
J2eeE ^o,e ■ ne{S*) IS au upper bound to H[R). We refer to as the total congestion of level 0. 

The idea is that the total congestion of level can induce on the resources a congestion (over the frontier 
ge) being the total congestion of level 1, such a congestion a total congestion of level 2, and so on. More 
formally, for any p > 1 and any e € E, we define Sp^e as the congestion of level p on resource e above the 
frontier g^; we say that a congestion 6p,e of level p on resource e is induced by an amount Xp.e of congestion 
of level p — 1 if some players (say, players in Np,e) moving on e can cause such a congestion of level p on e 
because they are experimenting a delay on the resources of their optimal strategies due to an amount Xp^e 
of congestion of level p. Notice that, for each move of the players in Np,e, such an amount Xp^e of congestion 
of level p — 1 can be used only once, i.e. it cannot be used in order to induce a congestion of level p for 
other resources in i?\ {e}. In other words, Xp^e is the overall congestion of level p — 1 on the resources in the 
optimal strategies of players in A^p,e used in order to induce the congestion Sp^e of level p on resource e. 

For any p, the total congestion of level p is defined as Ap — J^eeE ^p,e ' '^e('S'*). Moreover, for any p > 1, 
we have that X^esE ^p-e — f^^p-i because each player can move at most 13 times in R and therefore the total 
congestion of level p — I can be used at most /3 times in order to induce the total congestion of level p. 

It is worth noticing that p{R) < X)^o '^p ~^ Opt, because X]^o ^p.^ upper bound on the congestion 
of resource e during the whole covering R: 



n 



= 1 eGs* 
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^EE E^.-e + i =E -'^(^*) E'^p^^ + i 

oc oo 
eGBp=0 eS-E p=0 



In the following, we upper bound J2'^=o ^p- 

Ap = Y Sp^e ■ ne{S*) 







< 






eeE 


< 


E: 






< 


Ap- 



(4) 



(5) 



where inequality |4] holds because Sp^e is the portion of congestion on resource e above the frontier due to 
some moving players having on the resources of their optimal strategy a delay equal to Xp^e, and inequality 
[5] holds because each player can move at most f3 times in R and therefore the total congestion of level p — 1 
can be used at most /? times in order to induce the total congestion of level p. 
We thus obtain that, for any p > 0, Ap < ^ and J2'^=q — ^'^o- 

Since Ao = EeGB max{ne(50), 2/3^^(5*)} • UeiS*) < H{R) + 2/30pt and p{R) < E^o + Opt < 
2Aa + Opt, we finally obtain the claim. □ 

By combining Lemmata [2] and [31 it is possible to prove the following lemma showing that fast 
decreases between two consecutive coverings. 



Lemma 4. For any /3 > I, given two consecutive f3 -bounded T -coverings R and R, < 2y 2^^^ +4/3 + 1. 

Proof. Recall that the initial state S'^ of walk R coincides with the final state of walk R, and that H{R) = 
Y^J^i^f,^s* ('^e(<5'")). By inverting the summation in the definition of H{R), we obtain that H{R) = 

X^eGB "e(<5'°)ne(S'*). Therefore, by LemmajH < \j 2^^- Hence, by combining such an inequality with 

Lemma [31 the claim follows. □ 

By applying Lemma SI to all the couples of consecutive /3-bounded T-coverings, we are now able to prove 
the following theorem. 



Theorem 1. For any linear congestion game Q, any best response dynamics satisfying the {T, Fairness 

C(S) 
Opt 



Condition converges from any initial state to a state S such that — 0{(3) in at most T [log log n] best 



responses. 



Proof. Given a bast response dynamics satisfying the (T, /3)-Fairness Condition, let Ri,...,Rk be the k 
/3-bounded T-coverings in which it can be decomposed. By applying Lemma HI to all the pairs of consecutive 
/3-bounded T-coverings Rj and Rj+i, for any j = 1, . . . , fc — 1 we obtain that for any a > 1 



Opt V Opt 

By combining all the above inequalities for j — 1, . . . ,fc — 1 and by performing some basic algebraic 
manipulations, we obtain that ^4^^ = O i ^ \/ ^4^^ + /3 ) . Thus, by Lemma [H the cost of the final state 
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S of walk Rk is 



C{S) 
Opt 




By the definition of p{R), since Y^eeE''^e{S*) < Z^eeB '^e('5'*) = Opt, for any possible /3-bounded T- 
covering R it holds that 

n n 

piR) = E E (ne(5^-*«-^) + 1) < E E + 1) 

n 

= (n + 1) ^ \s*\ < (n + 1) 5] ne(S*) < (n + 1)0pt. 

i=l eeE 

Therefore, £^ < n + 1 and we obtain 



C{S) 
Opt 




It is worth noticing that k = log log n /3-bounded T-coverings are sufficient in order to obtain -^^^ = 0(/3). 
Since every ^-bounded T-covering, by its definition, contains at most T best responses, the claim follows. □ 



3.2 Lower bound 

Theorem 2. For any e > 0, there exist a linear congestion game Q and an initial state S*" such that, for any 
P = 0{n 1=82 there exists a best response dynamics starting from 5° and satisfying the {T, /S)- Fairness 
Condition such that for a number of best responses exponential in n the cost of the reached states is always 
l2(/3i-^ • Opt). 

Proof. We construct a congestion game Q for which there is an initial state Sq and an exponentially long 
sequence of best responses starting from this state. In every round, each player moves at most /3 times. For 
technical reasons, we assume /3 is even and /? > 10. 

We present the construction in two stages. In a first step we construct a game Q' that has the desired 
properties only if we assume players change their strategies even if they have exactly the same delay. In a 
second step, extend Q' towards a game G in which those strategy changes are improving best responses. 

We define fi = (j^^)'^~2»+i . The players and the resources are divided into L + 1 levels, 0,.. .,L. Each 
level i is divided into /3* blocks (i, 1), . . . , (i, /?*). Each block {i,j) consists of 3/i players, ^ main resources, 
and 2/jlog/3 address resources. We group the players into fi triplets and denote them by Pij,k, Qi,j,k, and 
Ri,j,k with 1 < fc < /j. The general idea is the following. In the optimal solution all players of level i only 
use the main resources of level i. The resources are evenly distributed among the triplets. However, in the 
sequence that we construct, the P player of a block of level i always allocate main resources of a block in 
level i + 1 such that these players cannot choose the main resources of level i + 1. We say the P players of a 
block {i,j) block the players of a block {i + l.j'). In each round, each block blocks blocks of the next level. 
The purpose of the Q and R players and the address resources is to make sure that all P players block the 
same block of the next level. 

We now describe the strategics and resources in detail. Each player has + 1 strategies that we denote 
by so(')j • • • I •s^(')- Only the sq strategies of players of a block contain main resources from level i. 

Moreover, the main resources of block are equally distributed among the triplets. The delay function 
of the main resources is f{x) = x. 

so{Pi,j,k) = so{Qi,j,k) = soiRi,j,k) = {p, m(fc-i) p ■■■,Pi ^} 
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The Q and R players use the address resources to binary encode a when playing s^. The Q players are 
for the even numbers the R players for the odd numbers. Thus, for every 1 < a < /3 the Q and R players 
have a strategy. 

Sa{Qi,j,k) = {<lij,k,i I for all 1 < I < log(^ - 1) with bit i of a is 0} U {gi^. | for all 1 < I < 
log(/3 — 1) with bit i of a is 1} 

Sa{Rr,3,k) = {rlj,k,i I for all! < / < log(/3 - 1) with bit « of a is 0} U {r^^^- | for all 1 < I < 
log(/3 — 1) with bit i of a is 1} 

The delay function for each of the r and q resources is f{x) — ^^r^x. 

The P players allocate main resources of the next level (« + 1) in their strategies Sa- More precisely, the 
players of block (z, j) allocate all main resources that are part of the sq strategy of block + — l)0 + a). 
Additionally, the strategy contain the address resources that we used above to binary encode a number. 
Note that the use of the address resources is inverted compared to the Q and R players. 

For all even a G {2, . . . , 

Sa{Pi,3,k) = {ti,j,k,a,Pi+l.,(]-l)l3+a,\,---,Pi+l,U-l)li+a,-^} ^ {l],3,k,l I for all 1 < I < log(/3- 

1) with bit Z of a is 0} U {qlj^k,i I for all 1 < / < log(^ - 1) with bit Z of a is 1} 

For aU odd a € {1, . . . , /3 - 1}: 
Soc{Pi,j,k) = {tiJ,ft,a,ft+i,(j-i)/3+c«,l, • • • ,ft+i,0-i)/3+a,-^} U {r,^^^_j._; | for all 1 < I < log(^ - 

1) with bit Z of a is 0} U {r?^- a, ^ | for all 1 < Z < log(/3 - 1) with bit Z of a is 1} 

The resource tij^k,a has constant delay of '"^{v i*''* if is odd and delay of '"^i+i"'"'' otherwise. 
Before we describe the sequence of best responses let us observe some useful properties of the construction. 
Since there are fi player in every block of level i, we can conclude the following. 

Claim. If all fi P players of a block {i, j) play strategy for some a, then each resource of block + — 
l)/3 + a) has delay of at least fi. 

We now observe that there a useful sequences of strategy changes in which the delay of the moving player 
stays the same. 

Proposition 2. For an even a, consider a strategy profile in which the players Pij^k and Qi,j,k of a group 
play strategy Sa, and the players Ri,j,k play Sa+i- If the players Pi,j,k change to Sa+i in an order 
increasing in k, then each player has delay of ^ before and after his strategy change. Additionally, 

when a player moves, his strategies Sa' with a' ^ {a, a + 1} have delay of more ^ . 

Proof. Consider a player Pi,j^k before he moves to Sq+i- Then fc — 1 many P players have already changed to 
Soi+i- Thus, the p resources of block [i + 1, {j — l)j5 + a) are used by fi — {k — 1) other players and, therefore, 
induce total delay of ^^{fi — k+1). Adding "^j^'i+i for the t resource and (log(/3 - l))0r for the address 
resources gives the desired result for the strategy Sa- 

If he changes to Sa+i, there will be k players on the the p resources of block (i + 1, (j — + a + 1) 
inducing total delay of -^^k. Again, with the t resource and the address resources that conveniently adds 

up to 

For the strategies s^' with a' ^ {a, a + 1}, first consider the case of a' being even. Observe that there 
is a least one bit in which a' differs from the even number in {a, a +1}. Therefore, at least one of the q 
resources in strategy Sa' of the P player is also used by the Q player of the same triplet and, thus, induces 
delay of 2^^. The delay of the address resources adds up to at least "^'i+i^ ^ and taking into account the 
delay of the t resource the claim follows. The case of an odd a' is analogous. □ 
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Claim. The delay of a blocked strategy sq is greater than a state in the sequence described in Proposition [2J 

• m r . Vfifi log /3 

' WTiJ'^~^ — ^^^^^^ — ■ 

We are now ready to describe the sequence of strategy changes. We start in a state 5*0 in which every 
player plays his strategy s^. The sequence is recursively defined as described by Algorithm [T] which is started 
by run(l, 1). Note that every player moves at most /3 time is this sequence. We call this sequence a run of 
Q' . In a nutshell: Whenever a block is blocked, the P-players iteratively block the sq strategies of certain /3 
blocks of the next level. The Q and R players make sure, that the P players block the sq strategies of the 
same block, i.e, all P players change to their strategy Sq for some a. 



Algorithm 1 Recursive procedure run(i,j') 
1: for a = 1 to /3 do 
2: if Q is even then 
3: for all 1 < fc < fi do 

4: Qi,j,k changes to Sa{Qi,j,k) 

5: end for 

6: for k — 1 to fi do 

7: Pij,fe changes to Sa{Pi.j.k) 

8: end for 

9: else 

10: for all 1 < A: < /i do 

11: Ri,j,k changes to Sa{Ri,j,k) 

12: end for 

13: for k — fi to 1 do 

14: Pi^j^k changes to Sa{Pi,j,k) 

15: end for 

16: end if 

17: run(i + 1, (j - l)/3 + a) 
18: end for 



In the optimal solution, all players play their sq strategy which yields social cost of 3m (L + 1). During 
our sequence the cost of level L is — (k^f^)^"^- Therefore, given any e > 0, it suffices to consider an 

e' < e and to choose L — ~ log e' so that the approximation ratio of all the states of the considered dynamics 
is always at least — — , ^ , — ■, — i— = Q(f3^~'^). 

3(l-loge') logi"' 13 ^' ' 

Note that the in the above sequence, the delay of a player that changes his strategy does not change. 
In order to complete the proof, we now turn the game into one in which these strategy changes are best 
responses. Let us give an overview of the modifications first. We combine above game Q' with a game Gn that 
has the property that there is an initial state and every sequence of best responses starting from that state 
is exponentially long. We essentially use the game that was presented in |14J and modify its delay functions 
to make them linear. The best response sequence of G„ resembles the recursive run of n programs in which 
program i is executed 2"^* times. We use the constants M > 20(5" and S ^ |A^|(/5 + 1) as scaling factors, 
where N denotes the set of players of G' ■ Note that they are independent of m which allows us to arbitrarily 
scale the cost of G". 

To each strategy of G' , we add a unique trigger resource with the delay function f{x) = x. Note that 
this does not change the preferences of the players. We extend the game with additional trigger players. 
Whenever a player of Q' is supposed to change to a strategy s, we make sure that the trigger resources in 
all his other strategies are used by trigger players. Therefore, his strategy change decreases his delay by 1. 

Let r denote the number of strategy changes of a run of Q' , We extend the game with r trigger players 
for each strategy of each player of Q' . Each trigger player has two strategies, trigger and wait. If he plays 
trigger, he allocates the trigger resource. His wait strategy is contains a resource that is also contained in 
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the strategies of a player of game G„. We have one reset player for each trigger player. He has a strategy 
wait which is also connected to a player of game Gn and a strategy reset. If he plays reset, the corresponding 
trigger player has an incentive to change back to wait. 

We are now ready to describe Q in detail. For each i ^ N and < a < /?, we add a resource Vi^a to 
strategy Saii)- It has the delay function f{x) = x. For each i G N and < a < /3, we add a player Ti^a- His 
strategies are described in Figured] We add n players Main^ as described in Figure 2] and 8n players Block^ 
as described in Figure [31 Note that resources of the wait strategies of the trigger and reset players are part 
of strategy (9) of player Maini. 

We now describe the best response sequence of Q. Algorithm [2] describes a recursive run of the players of 
G' , the trigger players, and reset players. Note that in such a run, every player moves at most /3 times. We 
now combine such runs with the best response dynamics of G„. Note that the best response sequence of G„ 
is essentially unique as described in 14!. We divide this exponentially long sequence into rounds. We define 
the beginning of a round as the state before player Maini moves from strategy (8) to (9). This divides the 
best response sequence into exponentially many rounds in which every player of G„ moves at most 9 times. 
At the beginning of a round, we interrupt this sequences and let the remaining players move. We first let all 
trigger and reset players play their best response. This ensures that they change to wait. We then let Maini 
move from strategy (8) to (9). We then start one run as described in Algorithmic] Finally, we let the players 
of Gn play their best response in decreasing order of their indices with player Blockf as the last player. He 
is also the only one that changes his strategy. With his move, we continue the best response sequence of G„ 
for the remainder of the round. 

This yields a exponentially long sequence of best responses that satisfies the (T, /3)-Fairness Condition. 



Strategies ofTi^a.d 


Resources 


Delay function f{x) 


trigger 




X 

2x 


wait 


Wi^a,d 


2x 



Fig. 1. Definition of the strategies of the trigger player Ti^a,d 



Strategies of Ui^a,d 


Resources 


Delay function f{x) 


reset 


u 

Ui^a^d 


3 

2x 


wait 


Xi^a,d 


4x 



Fig. 2. Definition of the strategies of the reset player Ui^a,d 



Stratej 


pes of Block^ 


Resources 


Delay function /(a;) 


(1) 






X 
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Mx 


(2) 




Mx 



Fig. 3. Definition of the strategies of the players Block^ 
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Algorithm 2 Recursive procedure run2(i,j) 

1: for a = 1 to /3 do 

2: if a is even then 

3: for all 1 < fc < /f do 

4: for all a' ^ a do 

5: T'(Q. cliange to trigger 

6: end for 

7: Qi,i,fc changes to Sa{Qi,j,k) 

8: for all a' ^ a do 

9: '^(Qi,3,fc),a',d cliange to reset 

10: 2"(Q,^ cliange to wait 

11: end for 

12: d = d+l 

13: end for 

14: for = 1 to fi do 

15: for all a' ^ a do 

16: T(Pi^j^k),a',d change to trigger 

17: end for 

18: Pi,j,fe changes to Sa{Pi,j,k) 

19: for all a' 7^ a do 

20: ?7(p. ^ ^),a',d change to reset 

21: T(p^^ fc),c«',d change to wait 

22: end for 

23: d = d+l 

24: end for 

25: else 

26: for all 1 < k < fi do 

27: for all a' ^ « do 

28: change to trigger 

29: end for 

30: Ri,j,k changes to Sa{Ri,j,k) 
31: 

32: for all a' ^ a do 

33: ^ f.),a',d change to reset 

34: ^. fc),a',d change to wait 

35: end for 

36: end for 

37: for A: = to 1 do 

38: for all a' ^ a do 

39: T(Pi^j^k),a',d change to trigger 

40: end for 

41: Pi,3,k changes to Sa{Pi,j,k) 

42: d = d+l 

43: for all a' / a do 

44: C^(p. j_fc),a',d change to reset 

45: r(p. . f.)^a',d change to wait 

46: end for 

47: end for 

48: end if 

49: run(i + 1, {j - l)/3 + a) 
50: end for 
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Fig. 4. Definition of the strategies of the players Main;. The delay of resource is constantly 95". Strategy (9) of 
player Maini additionally contains resources Wi^a,d and Xi^a,d for alH £ A*' and < a < /3 and 1 < d < r. 



□ 

By choosing j3 — ^fn and considering a simplified version of the proof giving the above lower bound, 
it is possible to prove the following corollary. In particular, it shows that even in the case of best response 
dynamics verifying an 0(n)-Minimum Liveness Condition, the speed of convergence to efficient states is very 
slow; such a fact implies that the T-Minimum Liveness condition cannot precisely characterize the speed of 
convergence to efficient states because it does not capture the notion of fairness in best response dynamics. 

Corollary 1. There exist a linear congestion game Q, an initial state S'^ and a best response dynamics start- 
ing from and satisfying the 0{n)-Minimum Liveness Condition such that for a number of best responses 

exponential in n the cost of the reached states is always fl (yj^^ ' Opt^ . 

Proof. The linear congestion game proving the corollary can be easily obtained by simplifying the construc- 
tion exploited in the proof of Theorem [5] such that /3 — y/n and the number of levels is 2 (i.e., L = 1). In 
such a way, ^^(i^^) players belong to level and the remaining players to level 1. 
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In the optimal solution, all players play their sq strategy which yields social cost of 6m. During our 
sequence the cost of level 1 is m/o — (^jj^ j ^ = fl {^-^^^ ■ 



□ 



4 Symmetric Congestion Games 

In this Section we show that in the symmetric case the unfairness in best response dynamics does not 
affect the speed of convergence to efficient states. In particular, we are able to show that, for any /3, after 
T [log log n] best responses an efficient state is always reached. To this aim, in the following we consider best 
response dynamics satisfying only the T-Minimum Liveness Condition, i.e. decomposable into k T-coverings 
Ri, . . . ,Rk- 

All the results hold for linear congestion games having delay functions fe{x) — + be with ae,be > 
for every e € E. Analogously to the asymmetric case, since our bounds are given as a function of the number 
of players, the following proposition allows us to focus on congestion games with identical delay functions 
f{x) = X. 

Proposition 3. Any symmetric linear congestion game is best-response reducible to a symmetric congestion 
game having the same set of players and identical delay functions f{x) — x. 

Proof. Given a symmetric congestion game Q having delay functions fe{x) = agX + be with integer coefficient 
fle; ^ 0; it is possible to perform a best-response reduction to a symmetric congestion game Q' , having the 
same set of players and identical delay functions f{x) = a; in the following way. For each resource e in Q, we 
include in Q' a set of resources and n sets . . . , B", each containing b^ resources; moreover, given 
any strategy set Sj € Z'^ in CJ, i = 1, . . . , n, we build n corresponding strategies s[ s[ 21 ■ ■ ■ t s[ ^ G S'^ {in G') 
by including in s'^ p for each e € Si, all the resources in the sets A^ and B^. If Oe and be are not integers we 
can perform a similar reduction by exploiting a simple scaling argument. 

The function g is such that any strategy profile S = (si, . . . , s„) of ^ is mapped to the strategy profile 
•S" = (s'l ij ^2 2 ■ • • J n) of Q' . In such a way, we make sure that, if player j is using resource e in S, in S' 
only player j is using the strategy associated with the set B^. In fact, the reduction guarantees that at every 
best response of the dynamics, for each resource e, at most one player uses the set of resources -Bf , for every 
k. 

If Oe and be are not integers we can perform a similar reduction by exploiting a simple scaling argument. 

□ 

Consider a generic T-covering R = (^S^ , . . . , S^) . Given an optimal strategy profile S* , since the 
t-th player Tr{t) performing a best response, before doing it, can always select any strategy s*, for 
i = 1 . . .n, of S*, her immediate cost c^(t)(S'*) can be upper bounded as ^ Y^^=i Sees* /e("-e(<S'*~^) + 1) = 
n SeGB ''^e{S*) fei^eiS*^^) + !)• In Order to prove our upper bound result, we introduce the following func- 
tion: 

- m) = i Etl EeeE ne(5*)/eK(5'-*W-l) + 1). 

Notice that r{R) is an upper bound to the sum of the immediate cost over the last moves of every players, 
i.e., r{R) > X]r=i Ci(S''''*'*^*^*''). Therefore, by exploiting the same arguments used in the proof of Lemma[l] 
it is possible to prove the following lemma relating the social cost C{S'^) at the end of R with r{R). 

Lemma 5. Given any T-covering R, C{S'^) < 2r{R). 

Moreover, given any T-covering i?, we can relate the social cost C{S^) of the final state of R with the 
cost C(5°) of its initial state. 



Lemma 6. Given any T-covering R, '^^'^^ < (2 -|- 2\/2)\/ ^^q^^^ 
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Proof. 



Opt Opt 



2 



Opt 



eeE 1=1 eGB 



^2 + -7^EE-e(^*K(5'-*«-^] 



= 1 eeE 



^ 2 + -tI- E jE «?(^*) jE «?(^'-'«'^) (7) 



nOPT 

i=l y eGB V eeE 



n 



Opt 



E%/0PTy^c(5i^^*w-i) 



j=i 



^2 + ;;;^E\/2<^(5'-*^^'-^) (8) 

<2 + — =Ev/2^ (9) 
nyOPT ^ 



where inequality ^ follows from Lemma [51 inequality ([7]) is due to the application of the Cauchy-Schwarz 
inequality, inequality ^ holds because C{S^^^^^'-^~^) < 2(p{S^^*-^^^~^), inequality © holds because the poten- 
tial function can only decrease at each best response and inequality PU)) holds because <?(5°) < C(S'°). □ 

By applying LemmajSlto all the couples of consecutive T-coverings, we are now able to prove the following 
theorem. 

Theorem 3. Given a linear symmetric congestion game, any best response dynamics satisfying the T- 
Minimum Liveness Condition converges from any initial state to a state S such that = in at 

most T [In In n~\ best responses. 

Proof. We decompose the dynamics into k consecutive T-coverings Ri, . . . , Rk- Recall that, for any j = 
l,...,k, Sj and Sj are the initial and the final state of covering Rj, respectively. Notice that for any 



J — 1 , . . . , /b 1, Sj — S^_^-^ and S — 5*^ 



By applying Lemma [Blfc — 1 times, we obtain 

Opt \ Opt 

C( ) 

Since, at it is easy to check, -^j^ < n, the claim follows by choosing k ~ [In Inn]. □ 
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5 Conclusion and Extensions 



In this work we have completely characterized how, in linear congestion games, the frequency with which each 
player participates in the game dynamics affects the possibility of reaching states with an approximation 
ratio within a constant factor from the price of anarchy, within a polynomially bounded number of best 
responses. We have shown that, while in the asymmetric setting the fairness among players is a necessary 
and sufficient condition for guaranteeing a fast convergence to efficient states, in the symmetric one the 
game always converges to an efficient state after a polynomial number of best responses, regardless of the 
frequency each player moves with. 

It is worth to note that our techniques provide a much faster convergence to efficient states with respect 
to the previous result in the literature. In particular, in the symmetric setting. Theorem [3] shows that best 
response dynamics leads to efficient states much faster than how e-Nash dynamics (i.e., sequences of moves 
reducing the cost of a player by at least a factor of e) leads to e-Nash equilibria [3]. Furthermore, also in 
the more general asymmetric setting. Theorem [T] shows that the same holds for fair best response dynamics 
with respect to e-Nash ones [5]. 

Although we have focused on linear congestion games, all the results can be extended to the more general 
case of congestion games with polynomial delays by exploiting techniques similar to the ones used in j9|10| . 
In particular, it is possible to show that, if each player is allowed to play at least once and at most f3 times 
any T best responses, states with approximation ratio 0(/3) times the price of anarchy are reached after 
T [log log n] best responses, that such a bound is essentially tight also after exponentially many ones and 
that, in the symmetric setting, the game always converges, after T [log log n] best responses and for any 
value of /3, to states with approximation ratio order of the price of anarchy. 
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